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Abstract — The focus of this work is on the analysis of transmit 
beamforming schemes with a low-rate feedback link in wireless 
sensor/relay networks, where nodes in the network need to 
implement beamforming in a distributed manner. Specifically, 
the problem of distributed phase alignment is considered, where 
neither the transmitters nor the receiver has perfect channel 
state information, but there is a low-rate feedback link from 
the receiver to the transmitters. In this setting, a framework 
is proposed for systematically analyzing the performance of 
distributed beamforming schemes. To illustrate the advantage 
of this framework, a simple adaptive distributed beamform- 
ing scheme that was recently proposed by Mudambai et al. 
is studied. Two important properties for the received signal 
magnitude function are derived. Using these properties and the 
systematic framework, it is shown that the adaptive distributed 
beamforming scheme converges both in probability and in mean. 
Furthermore, it is established that the time required for the 
adaptive scheme to converge in mean scales linearly with respect 
to the number of sensor/relay nodes. 

Index Terms — Array signal processing, convergence of nu- 
merical methods, detectors, distributed algorithms, feedback 
communication, networks, relays. 



I. Introduction 

The problem of distributed beamforming arises quite natu- 
rally in wireless sensor/relay networks. In a sensor network, 
sensors make estimates of a common observed phenomenon 
and reach a consensus using a local message passing algo- 
rithm. In a relay network, a source node intends to communi- 
cate with the destination node by passing the message to all re- 
lay nodes. In both settings, the sensor/relay nodes then serve as 
distributed transmitters and seek to convey a common message 
to the intended receiver. To preserve energy in this stage, trans- 
mit beamforming has emerged as a promising scheme due to 
its potential array gain and low-complexity. However, perfect 
channel state information (CSI) at the transmitter is required 
by conventional transmit beamforming schemes to generate 
beamforming coefficients and achieve phase alignment at the 
receiver end. This requirement and the distributed nature of 
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wireless sensor/relay networks make it difficult to implement 
transmit beamforming schemes in practice. Although obtaining 
perfect CSI may be too expensive from a practical point-of- 
view, partial CSI can be made available via a low-rate feedback 
link from the receiver to the transmitters. As a consequence, 
there has been increased interest in designing efficient schemes 
that achieve distributed phase alignment in the presence of 
a low -rate feedback link (TJ, (2|, El, 0. In this work, our 
goal is to provide a framework for systematically analyzing 
the performance of a general set of distributed beamforming 
schemes with such low-rate feedback. 

To illustrate the advantages of our framework, we focus 
on the analysis of a recently proposed training scheme for 
distributed beamforming (TJ, Q. The proposed scheme is 
a simple adaptive algorithm using one bit of feedback in- 
formation, and is attractive in practice since it is simple to 
implement. Naturally, one would expect a tradeoff in energy 
consumption due to possible slow convergence of distributed 
beamforming, but surprisingly, the scheme proposed in HI 
converges rapidly and hence utilizes energy efficiently. The 
scheme adjusts its phases for all sensors simultaneously in 
each time slot to achieve phase alignment. This reduces the 
overhead significantly compared with direct channel estima- 
tion between each source node and the destination node. In 
fact, the convergence time of the scheme scales linearly with 
the number of nodes. 

Although the scheme of |[T| has many desirable features, the 
fundamental reasons behind the effectiveness of the scheme 
are unclear from previous work. In [2], the analyses of the 
convergence and linear scalability of distributed beamforming 
schemes have been based on model approximations, which 
may be loose for some cases. Assuming the stepsize ap- 
proaches zero, stochastic approximation is used in J5] to 
show the convergence of the one-bit scheme in distribution. 
Furthermore, the authors proposed two more algorithms: the 
signed algorithm and the p % solution algorithm and proved 
the convergence of both algorithms via the same technique. 
A discrete version of the problem has been solved in Pfl, ||51l 
by considering a simplified model with a binary channel and 
binary signaling. 

In this work, instead of focusing on the convergence of 
a particular algorithm for a particular function, we seek a 
fundamental understanding into the convergence of distributed 
beamforming schemes more generally by studying them within 
the framework of local random search algorithms. Through 
this framework, we are able to provide a more comprehensive 
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analysis of the fast convergence and linear scalability of the 
scheme proposed in (T). In particular, our analysis does not 
involve approximation of any sort and hence makes statements 
on convergence and linear scalability in Q, (3] more rigorous. 
Our result is also stronger than that in J3] in the sense that 
convergence in probability is proved instead of convergence in 
distribution. Further, we show that due to the special structure 
of the objective function considered in this problem, any adap- 
tive distributed beamforming scheme that can be reformulated 
as a random search algorithm converges in probability. This 
broad set of algorithms also includes the signed and the p % 
solution algorithms proposed in Q and makes our analysis 
more general and rigorous than existing work in the literature. 

We organize the paper as follows: In Section [II] we in- 
troduce the system model and the received signal magnitude 
function, which is used as our metric to measure the beam- 
forming array gain throughout the paper. In Section |lIIJ we 
propose a framework that allows for a systematic analysis of 
a general set of adaptive distributed beamforming schemes. 
Specifically, we reformulate this set of adaptive distributed 
beamforming schemes as random search algorithms via a 
general framework. This reformulation provides insights into 
the necessary condition for the convergence of the scheme 
proposed in flj. These insights lead us to investigate the 
properties of the received signal magnitude function in Section 
IIVI We further use these properties to prove the convergence of 
the local random search algorithm in probability and in mean, 
and provide simulations to validate our analysis. In Section Ivl 
we show that the time required for the algorithm to converge 
in mean scales linearly with the number of nodes. We also 
provide numerical results that validate our analysis. Finally, 
we conclude the paper in Section [VI] and suggest directions 
for future research. 

II. System Setup 



We assume a noncoherent communication model, where the 
realization of the channel is unknown at both the transmitters 
and receiver. There is, however, an error-free, zero-delay feed- 
back link of finite capacity from the receiver to all transmitters 
conveying low-rate partial channel state information (CSI) in 
each time step. 

The goal of distributed beamforming is to pick the beam- 
forming coefficients {gi[t] = b^e^^} to maximize the 
received SNR. In a noncoherent setting and with a low-rate 
feedback link, beamforming can only be achieved adaptively 
through training. Without loss of generality, we assume that the 
signal s[t] is constant during the training stage. Furthermore, 
we make the following two simplifications. First, we assume 
that each transmitter utilizes the same amount of energy for 
each transmission, i.e., that bj\t\ = 1 for all i and t, i.e., we 
do not optimize the beamforming gains, and we therefore set 
s[t] = \[P. This assumption is justified for situations where 
the transmitters rely on a limited energy source (battery) and 
allowing them use different amounts of energy would cause 
some nodes to use up their energy before others. Secondly, 
we assume that the receiver can estimate the magnitude of 
the signal component at the receiver (without the noise term 
w[t] in (fTJl). We therefore use received signal magnitude as 
the metric for optimizing the beamforming phases. 

The received signal magnitude can be expressed as 



Mag(#! [£],••• ,0„ 



= VP 



(2) 



where 9i [t] = fa + ipi [t] is the total received phase for sensor 
i. 

It is easy to see that Mag(-) is maximized when the phases 
{#i[t]} are aligned, i.e., they are equal to each other (modulo 
2tt). Our goal is to study adaptive distributed beamforming 
schemes that achieve this phase alignment through the use of 
a low-rate feedback link from the receiver. 



We consider the problem of distributed beamforming, where 
n s transmitters seek to beamform a common message to 
one receiver in a distributed manner. We assume that each 
transmitter and the receiver is equipped with one antenna, and 
that the channels from the transmitters to the receiver experi- 
ence frequency-flat, slow fading. The discrete-time, complex 
baseband system model over a coherence interval is given by 



y[t} = ^h igi [t]s[t}+w[t] 



(Hb i [t]e i (* i+ + t W>8[t]+w[t] 
(1) 

where s[t] E C is the transmitted common message, y[t] G C is 
the received signal, and w[t] ~ CAf(0, a 2 ) corresponds to the 
additive white Gaussian noise. For transmitter i, we denote the 
channel fading gains by hi = a^e^ <= C and beamforming 
coefficients by g^t] = b^e^^ € C. Note that a t > 0, 
bi[t] > 0, and fa £ [0,2tt], fa[t] G [0,2tt] for all i and t 
since they are the corresponding magnitudes and phases of h{ 
and gi, respectively. Moreover, a, and fa are considered to be 
constant with time over the coherence interval due to the slow 
fading assumption. We assume an average power constraint on 
s[t] given by -E[|s[t]| 2 ] < P for all t. 



III. A Framework for Systematic Analyzing 
Adaptive Distributed Beamforming Schemes 

In this section, we introduce a framework for analyzing 
a general class of adaptive distributed beamforming schemes 
that can be reformulated as random search algorithms. Random 
search algorithms are well studied in the literature @, Q, 
(8) as methods to maximize an unknown function via random 
sampling. Once an adaptive distributed beamforming scheme 
can be successfully reformulated as a random search algo- 
rithm, a systematic study of the convergence of such adaptive 
scheme is possible. 

A. Reformulation of Adaptive Distributed Beamforming 
Schemes as Random Search Algorithms 

Adaptive distributed beamforming algorithms introduced in 
Section [TT] seek to maximize Mag(-) given in (0 with the help 
of a low-rate feedback link. At each step of the adaptation, the 

1 A good estimate of the received signal magnitude can be obtained directly 
when the noise is small, or by averaging over several time slots when the noise 
is not negligible. 
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signal magnitude at the receiver is a sample of the function 
Mag(-) . Thus, from the receiver point of view, the problem 
of distributed phase alignment can be considered under the 
setting of the following problem: 

Problem 1: Given a unknown function f : — > K, C 
R", where only samples of f(6) are available for arbitrary 
9 € 0, find the global maxima of f. 

It is important to note that Problem\J}is a global maximiza- 
tion problem in general if no special structure is assumed for 
the objective function /. To solve the maximization in Problem 
[7J one may be tempted to use gradient-based algorithms that 
are well-developed in the literature. Since it is possible for / 
to possess local maxima, conventional gradient-ascent methods 
would fail in general. Besides, acquiring the gradient of the 
function / may be infeasible especially when the function 
itself is unknown. Hence, random search techniques |6|, Q, 
(8) are more appropriate in this setting and can be described 
as follows: 

A Random Search Algorithm: 

• Step zero: Initialize the algorithm by choosing 6[0) £ 0. 

• Step one: Generate a random perturbation S[t] from the 
sample space (M™, B, /it), where B is a Borel set on 
R™ and [i t is a probability measure that could be time- 
varying. 

• Step two: Update the search point by 0[t] = D(6[t — 
l].S[t]), where the map D satisfies the condition 

f(D(e[t-i],s[t}))>f(0[t-i}). 

Clearly, for a random search algorithm, we require only 
function evaluations and control over the probability measure 
/it, which is used to sample the function. Any adaptive dis- 
tributed beamforming scheme can be reformulated as a random 
search algorithm if each distributed transmitter initializes its 
phase as in Step zero, generates a random perturbation of phase 
as in Step one, and updates its new phase by the map D as 
in Step two. The low-rate feedback link is used to guarantee 
the condition f(D(6[t-l],S[t])) > f(6[t-l]). Note that the 
unknown function / can be any objective function that we find 
fit for the distributed transmitters to optimize. This suggests 
that our framework can be used to analyze a more general 
function optimization problem over distributed networks. Note 
further that the probability measure /it for the sampling can 
be time-varying in general. The time-varying nature of the 
probability measure can be thought of as "adaptive stepsize" 
for distributed algorithms in the most general sense. In this 
sense, our framework can be used to analyze a large set of 
adaptive distributed algorithms. 



B. One-bit Adaptive Distributed Beamforming Scheme 

To illustrate the advantage of our framework, we now 
analyze a one-bit adaptive distributed beamforming scheme 
recently proposed in HI. Specifically, we reformulate this 
scheme as a local random search algorithm, which allows for 
its systematic analysis. We begin by describing the one-bit 
adaptive distributed beamforming scheme as follows: 

A One-bit Adaptive Distributed Beamforming 
Scheme fR : 



Step zero: Referring to (f2l) and noting that the z-th trans- 
mitter controls its beamforming phase tpi [t], the algorithm 
is initialized by setting ipi[0] = 0, and hence 6i[Q] = <pi 
for transmitter i. 

Step one: In this step, a random perturbation Si [t] is gen- 
erated at each distributed transmitter such that {(^[i]}™^ 
are i.i.d. uniform random variables in [— <5o, <5o] across 
time and transmitters, where Sq is a constant parameter. 
The random perturbation is added to the total phase of 
each transmitter. The distributed transmitters then use the 
perturbed total phases as their new total phases to transmit 
the training symbol. 

Step two: After receiving the training symbols, the re- 
ceiver measures the received signal magnitude and com- 
pares it with the signal magnitude received in the previous 
time slot. If the newly received signal magnitude is 
larger, the receiver feeds back a "keep" beacon to the 
transmitters. Otherwise, a "discard" beacon is sent to the 
transmitters. Note that the beacon is a broadcast from 
the receiver to all transmitters. Clearly, this feedback 
scheme only requires one bit of feedback information per 
time step. When a "keep" is received at the transmitters, 
each transmitter selects and keeps its newly updated total 
phase. Otherwise, the old phase is selected and the new 
phase discarded. This selection process is determined by 
whether the random perturbation increases or decreases 
the array gain for the adaptive distributed beamforming 
scheme. Specifically, the evolution of 9[t] is given by 

0[t-l] + 6[t], if S[t] e K 
0[t-l], if S[t] i K 

where 0[t] [0i[t], • • • , 0„. [t]] T , S[t] 

[Si[t],--- ,SnM] T > and £ = {S[t}\\Mag(9[t - 1] + 
6[t]) >Mag(0[i-l])}. 
Matching the steps of the above one-bit adaptive scheme 
and those of a random search algorithm introduced in Sec- 
tion IIII-AI it is clear that the one-bit adaptive distributed 
beamforming algorithm can be regarded as a special case of 
the random search algorithm by setting 

/ = Mag(-) (4) 
n = n s (5) 
= [0,271]™= (6) 

Mi = A* ( 7 ) 
D(0[t-l],8[i\) = 0[t-l] + l {s[t]eK} 8[t] (8) 

where is the indicator function and /i is uniform on 
[— <5o,<5o] n % which is a n s -dimensional hypercube. Note that 
^ is the same as the evolution described by ([3]). 

Since the probability measure /i is non-zero only within 
a hypercube, with sides of length 25q and centered around 
0[t — 1], the one-bit adaptive distributed beamforming scheme 
can be reformulated as a local random search algorithm. We 
emphasize again that we can use this framework to study 
more general adaptive distributed beamforming schemes. For 
example, the probability measure for sampling may be time- 
varying and with a support that spans the entire space 0. We 
can also study adaptive distributed beamforming schemes with 



6[t) = 



(3) 
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more than one bit of feedback information. It is also interesting 
to note the connection between this local random search 
algorithm and simulated annealing [9|. Simulated annealing is 
a generic probabilistic algorithm that approximates the global 
optimal solution of a given function in a large search space. 
The algorithm uses a parameter T called the temperature to 
control the acceptance probability, i.e., the probability that the 
current state of the algorithm transitions to a new state. If we 
let T — > and assume that the current state is only allowed to 
move to neighboring states, the simulated annealing procedure 
reduces to a local random search algorithm. 

A local random search algorithm, however, does not nec- 
essarily converge in general. For example, if the unknown 
function possesses local maxima (that are not global maxima), 
the sequence {#[£]}^ is likely to be trapped in a local 
maximum if the local perturbation 5q is not large enough. 
Thus, a necessary condition for the convergence of local 
random search algorithms for arbitrary Sq is that there is 
no local maximum point for Mag(-). With these in mind, 
two questions arise naturally: a) Does the reformulated local 
random search algorithm even converge? b) If it does, is 
there a fundamental reason behind the convergence? In the 
following section, we investigate properties of the function 
Mag(-) towards the goal of addressing these questions. 

IV. Convergence of the Distributed Beamforming 
Scheme 

A. Properties of Received Signal Magnitude Function 

The properties of the received signal magnitude function 
Mag(-) do not depend on the time evolution of its arguments. 
We hence ignore the time dependence of 9[t] in this section. 
The following proposition states the first property of Mag(-). 

Proposition 1: For the received signal magnitude function 
Mag(-) defined in (|2), all local maxima are global maxima. 

Proof: To facilitate analysis, we introduce a change of 
variables 







cos 9i 






sin 9i 



Eqn. (O can be rewritten as 



Mag(xi, • • • ,x„ s ) = VP 

,n s . The maximization of 



where ||xj|| 2 = 1 for all i = 1, • 
Mag(-) can be rewritten as 



max 

||xi|| 2 =l,i=l,— ,n s 



(9) 



In the following, we will show that all local maxima of this 
objective function correspond to complete phase alignment for 
all transmitters. That is, all local maximum points are global 
maximum points. 

By relaxing the equality constraints to inequality constraints, 
the optimization problem in (O is equivalent to 

2 



max 

2 <l,i=l, 



i=l 



(10) 



This equivalence can be seen as follows: if x* is a local 
maximum with an inactive constraint ||x£|| 2 < 1, by fixing 
all other variables {x*}^^, we obtain 



i=l 



t *R i R\2 

(a k x k +cr^ 



) 2 + Kxl/+ C / ) 2 



where c = [c H c ] is a constant vector depending on {x*}.,^. 
Obviously, the above function can be improved by appropri- 
ately perturbing ||x£|| according to the signs of c R and c 1 . 
This contradicts the fact that x* is a maximum. Thus, all 
constraints are active if x* is a maximum point. This shows 
that the optimization problems (0 and ( fTOb are equivalent. 

Focusing on the optimization problem with relaxed con- 
straints, the Lagrangian of ( fT0T > reads 



L(x,A) = -||w|| 2 +5>(||x 4 || 2 -l) 

i=l 



where x = [xf\ • • • , x^J T , A = [Ai, • • • , A„J T , Aj > for 
all z = 1, • • • ,n s , and w = 2^™=i a i x i- By the Lagrange 
Multiplier Theorem, all local maxima satisfy 



V Xl L(x, A) = -2a i w T + 2A,xf = 1 



i=l 



i) 



1 < o 



(ii) 

(12) 
(13) 



for all i = 1, • • • , n s . Let x* be a local maximum and A* be 
the corresponding Lagrange multipliers. If A* = 0, Eqn. ( TTTb 
implies that w = sincq 2 ] a, > 0. In this case, Mag (x* ) = 
and this contradicts the fact that x* is a local maximum, since 
we can always improve Mag(-) by letting x* = [£ 0] T , £ < 1, 
and Xj = for all j ^ i. This leads to A; > for all i. We 
hence have 



-w 



A* = 



Oillwll 



(14) 
(15) 



The optimal solutions described by (fl4l and (15[ , however, 
also satisfy 



Mag(x*) = VP 



Ew 
"'HI 



i=i 



and hence are global maxima. This completes our proof. 

■ 

Pro/?os/f/on[T]implies that the local random search algorithm 
cannot be trapped in a suboptimal local maximum since all 
local maxima are global maxima. Furthermore, it also suggests 
that the necessary condition for the convergence of random 
search algorithms is satisfied. While it is intuitively clear that 
the local random search algorithm should converge according 
to Proposition [T] it is to be noted that the condition is 
only necessary and may not be sufficient. We will provide a 
rigorous proof of the convergence of the local random search 

2 Note that the case where = is not interesting since we can always 
reduce the dimension of the problem by ignoring x; 
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algorithm later. Now, we explore an additional property of 
Mag(-) that explains the efficiency of the algorithm. 

Another interesting property of Mag(-) is that it is invariant 
under a common phase shift to all transmitters. That is, 



Mag(0 



= VP 



= Vp 



E 

i=l 



J6 C 



»=i 



Mag(0) 



where e is a n s x 1 vector with all elements equal to one, 
and 9 C is a common phase shift that can depend on {6^}™^. 
One possible choice for the common phase shift is to let 
O c (0i,--- ,8 ns ) be such that the imaginary part within the 
modulus function is canceled, i.e., 



Mag(0) 



= Mag(0 - 

n s 

= ^E 



.0n>) 



a,; cos ( 



P^ai cos ^ = Mag(0') 



where 9' = [9[, ■ ■ ■ , 6' n ] T . Note that in the shifted 0' domain, 
the global maxima occur only when ff i = or 2kir for all i, 
where k is any integer. The shift-invariant property results in 
multiple global maxima for the function Mag(-). In fact, all 
global maxima form a one-dimensional "ridge" since if 0* 
is a global maximum, with 0* = 9* + 9 C is also a global 
maximum. This property leads to the rapid convergence of 
the local random search algorithm since converging to any of 
these global maximum points is adequate. 

We conclude this section by summarizing these two impor- 
tant properties of Mag(-) as follows: 

1) all local maxima are global maxima, and 

2) a common shift to its arguments does not change its 
value. 

B. Proof of Convergence 

Intuitively, Property 1 guarantees the convergence of any 
local random search algorithm. To make this precise, we 
introduce an e-convergence region 



R e = {9 g 6 : Mag(0) > Mag (9*) - e} 



(16) 



where 9* is the optimal total phase and satisfies Mag(0*) = 
V~PY^i=\ a i- We define the convergence of a random search 
algorithm in probability as follows: 

Definition 1: A sequence {0[£]}^ o generated by a random 
search algorithm is said to be convergent in probability if, 
given e > 0, 

lim Pr[0[i] G R e ] = 1 

t— >oo 

In other words, Mag(0[f]) converges to Mag(0*) in probabil- 
ity. 

For the proof of convergence, we futher derive a proposition 
stating that for any outside of R e , there is a non-zero prob- 
ability of improving Mag(-) by applying a local perturbation 
to 0. 



Proposition 2: For any given G Q\R e and 8q > 0, there 
correspond 7 > and < rj < 1 such that 

Pr [Mag(0 + 5)- Mag(0) > 7] > 77 

where d is a random vector with i.i.d. elements uniformly 
distributed over [— 5q,^o]- 

Proof: From Proposition \T\ all local maxima are global 
maxima for the function Mag(-). This implies that for all ^ 
R e and all Sq > 0, there exists a point 9 U G Sg and a constant 
7(0) > such that 



Mag(0„) - Mag(0) > 2 7 (0) 



(17) 



where the set Sg is a hypercube of length 2Sq centered around 
given by 

Sg = {ujeG:iv = 9 + 5, S G [-6 ,5o\ n »} 

The continuity of Mag(-) implies that there exists cr(O u ) > 
such that for all £ G T := {lj G : ||o>|| < er(0 M )}, we have 

|Mag(0. u +O-Mag(0 u )| < 7 (0) (18) 

Combining ST% and ( fT8l . we arrive at a lower bound 

Mag(0 u + C) - Mag(0) = Mag(0« + £) - Mag(0 u ) 

+ Mag(0 u ) - Mag(0) 
> - 7 (0) + 2 7 (0) = 7(0) 

Referring to (O for the definition of [i, the above lower bound 
leads to 

Pr [Mag(0 + 8) - Mag(0) > 7(0)] > fi (T) =: 77(0) 

Note that p,(T) is a function of 0, since 9 U is a function of 
0. We complete the proof of the proposition by letting 



7 
'/ 



inf 7(0) 
eee\R e 

inf n(8) 

eee\R e 



Note that the proof of this proposition can easily be gener- 
alized for any local random perturbation 6. Since before the 
sequence reaches the e-convergence region, there is always a 
non-zero probability of improving Mag(-) for each time step, 
the convergence of the sequence is to be expected. A simple 
deterministic analogue is the convergence of a monotonically 
non-decreasing function. The probabilistic nature of the algo- 
rithm complicates the proof. This will become clear in the 
proof of our next theorem. 

Theorem 1: For the function Mag(-) defined in (2|, let 
{0[t]}^ 1 be a sequence generated by the local random search 
algorithm described in Eqn. (@]i-(|8]l. Then the resulting se- 
quence converges in probability, i.e., given e > 0, 

lim Pr[0[f] G Re] = 1 

t— > 00 

Proof: By Proposition |2] we know that given any time t 
Pr [{Mag(0[£ - 1] + 6[t\) - Mag(0[< - 1]) > 7} or {0 G i?J] > fj 
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where fj = min {Pi[0 G R e ], rf\. Since 8 is compact and 
Mag(-) is continuous, there always exists a positive integer 
p such that 

P1 > Mag(0i) - Mag(0 2 ), V0i, 2 € 6 

The probability that the sequence lies in R € after p time steps 
is hence lower bounded by 

Pr[0[p] G iZ,,] > ff 

since {<5[i]}^ are independent across time. This leads to 
Pr [0[p] ^ i? e ] < 1 - ff and 

Pr [6»H G iJe] = 1 - Pr [e\pm] <£ R e ] > 1 - (1 - rf ) m 

for all m = 1, 2, ■ • • . The lower bound is still valid if we let 
the sequence progress £ time steps further, i.e., 

Pr [9[pm + £] G R e ] > 1 - (1 - fj p ) m 

for all m — 1, 2, • • • , f = 0, • • ■ , J>— 1. We complete the proof 
by letting m —> oo. 

■ 

Theorem \T\ states that the local random search algorithm 
in ©-(O converges in probability, and hence also provides 
a proof of convergence for the one-bit adaptive distributed 
beamforming scheme in @. In particular, Theorem \T\ implies 
the convergence of the sequence {Mag(0[i])}^L o in proba- 
bility. Since the sequence is non-negative and monotonically 
non-decreasing, we can conclude that {Mag(0[i])}^ o also 
converges in mean by the Monotone Convergence Theo- 
rem [ 1 1 . Further, by properly generalizing Proposition [2] 
it is straightforward to show that any adaptive distributed 
beamforming scheme that can be reformulated a local random 
search algorithm and seeks to maximize any objective function 
that satisfies Property 1 converges in probability. 
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Fig. I. Evolutions of sequences generated by the adaptive distributed 
beamforming scheme. 

In Fig. Q] we illustrate the evolution of the sequences 
generated by the local random search algorithm from different 
initial points. The initial points are generated randomly from 
a uniform distribution over 9. Only three sample paths of 
the sequence are included in the figure since similar behaviors 



can be observed for other sample paths. For each iteration, the 
random perturbation Si for the ith transmitter is a uniform ran- 
dom variable over [— 6q,So], where 8q = ir/30. Note that we 
use the same channel coefficients to generate these sequences 
since the focus here is on the effect of different initial points. 
In particular, the channel coefficients are randomly generated 
from i.i.d. £A/"(0, 1) in the beginning of the simulation, and 
remain fixed afterwards. 

From the figure, we observe the rapid convergence of the 
local random search algorithm, irrespective of where it is ini- 
tialized. We emphasize again that the fast convergence results 
follow from the two important properties for the function 
Mag(-) as discussed in Section HV-AI Property 1 guarantees 
the convergence of the local search algorithm; Property 2 
results in multiple global maxima for the function Mag(-) and 
hence the fast convergence of the algorithm. The simulations 
provide a partial validation of our proof since we would expect 
the convergence to fail from some initial points if there were 
non-optimal local maxima for Mag(-). It is to be noted that 
the convergence of the local random search algorithm does 
not guarantee that it is the most efficient scheme in terms 
of the number of function evaluations, and hence the most 
efficient scheme in terms of energy. However, the algorithm 
does have a desirable scaling property, i.e., the time required 
for the algorithm to converge in mean scales linearly with 
the number of transmitters. This is the topic of the following 
section. 

V. Scaling Law 

Due to the probabilistic nature of the local random search 
algorithm, we defined convergence in probability in Section 
IIV-BI and showed that the local random search algorithm 
converges. For the analysis of the scaling law, however, we can 
only show convergence in mean, which is defined as follows: 

Definition 2: A sequence {0[t]} generated by a random 
search algorithm is said to converge in mean if there exists 
<at > such that 

e {S[t]}< t=0 \*Mo] [ Ma §(0M)] > Mag(0*)-e = VPX>,-e 

for all t > tjy, where a = [ai, • • • , a„J T . That is, Mag(0[t]) 
converges to Mag(0*) in mean. 

In this section, our goal is to find the time required for the 
local random search algorithm to converge in mean, starting 
from any initial point. In other words, we are interested in 
finding the hitting timJ^ of the random search algorithm, 
and determining its behavior as a function of the number of 
transmitters. Specifically, we derive an upper bound on the 
hitting time of the local random search algorithm as a function 
of n s . Note that the study of the hitting time makes sense 
only if the sequence indeed converges in mean, which we 
established in Section HV-BI 

3 The hitting time in this work is defined as the time required for the 
algorithm to converge in mean. 
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To facilitate analysis, we define the increment function of 
Mag(-) at time r as 

I[r] = [Mag(0[r])-Mag(0[r-1])] + 

= [Mag(0[r - 1] + S[t]) - Mag(0[r - 1]] + (19) 

where [x} + = max(x, 0). We then rewrite the received signal 
magnitude function at any given time kon s as 

kon s kon s 

Mag (0[AonJ) - £ I[t] + Mag (0[O]) =: £ /[r] + c 

T=l T=l 

(20) 

where fco is a positive integer and Co > 0. 

From Proposition [2] we have that for any given r such that 
8[t — 1] ^ R e and any local random perturbation S[r], there 
correspond 7 > and < 77 < 1 such that 

Pr [Mag(0[r - 1] + S[t]) - Mag(0[r - 1]) > 7] > 77 

Thus, we have 

■E«[ T ]| a ,e[T-i] [I[t\] 

> 7P1- [Mag(0[r - 1] + S[t}) - Mag(0[r - 1]) > 7] 

> 777 > 

for any r such that 0[t — 1] g R e . Referring to (|T9b-(f20T>. we 
obtain 



E 



{5[T]}%™n a M[o] [Mag(0[fc o n s ])] 

kon s 



^2 ^Mla^Ir-i] [I[r]] + c > k n s <yr] + c > n/P^i 



where the last inequality follows by choosing fco = 
v ^ m ^' {Qi} ■ This implies that the hitting time for the local 
random search algorithm is at most fcon s , from any initial 
point. Hence, the hitting time for the algorithm scales linearly 
with the number of transmitters. 




Fig. 2. Hitting time for the adaptive distributed beamforming scheme with 
different values of a. 



In our simulations, we say that the sequence converges 
to the a fraction of the global maxima if Mag(0[t]) > 
aMag(0*). We assume that channel coefficients are i.i.d. 
complex Gaussian variables CAf(0, 1), and use the origin as 




Fig. 3. Average convergence time for the adaptive distributed beamforming 
scheme with different values of a. 



our initial point. We set So = 7r/90 for all our simulations. 
Fig. |2 demonstrates the hitting time required for the adaptive 
distributed beamforming scheme to converge in a relative 
sense when a = 0.5, 0.7, and 0.9. It is clear that the 
hitting time increases as a increases. The scaling law for 
the hitting time with respect to n s , however, is the same 
for all values of a. Indeed, we observe linear scaling for all 
values of a. This observation confirms our theoretical analysis. 
Fig. [3] shows the average convergence time for the adaptive 
distributed beamforming scheme to within a fraction of the 
globally maximum value aMag(0*), for different values of 
a. It is important to note the difference between the hitting 
time and the average convergence time. Since our algorithm 
is probabilistic in nature, the convergence time is essentially 
a random variable and each run of the algorithm provides 
a sample for this random variable. Fixing the number of 
transmitters n s , we obtain the average convergence time by 
averaging over a hundred samples of this random variable, 
while the hitting time is obtained by comparing .E[Mag(0[t])] 
with aMag(0*). From Fig. [3] we observe the same linear 
scaling behavior for the average convergence time. We expect 
this property for the average convergence time can be shown 
in a similar manner. 

VI. Concluding Remarks and Future Work 

In this work, we have proposed a framework that allows 
for a systematic analysis of adaptive distributed beamforming 
schemes in sensor/relay networks. We used this framework to 
study the convergence and scaling law of a recently proposed 
one-bit adaptive distributed beamforming scheme [lj. We first 
reformulated the one-bit adaptive scheme as a local random 
search algorithm. This reformulation provided insights into the 
convergence of the one-bit adaptive scheme, and led us to 
investigate the fundamental properties for the received signal 
magnitude function Mag(-). We identified two important prop- 
erties of the function that contribute to the rapid convergence 
of the algorithm. First, all local maxima are global maxima. 
This prevents any local random search algorithm from being 
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trapped in non-optimal local maximum points. Secondly, the 
Mag(-) function is invariant under a common shift to its 
arguments. This property results in multiple global maximum 
points for Mag(-) and hence the rapid convergence of the 
algorithm. Based on these properties, we have shown the 
convergence of the algorithm, both in probability and in mean. 
We further provided an upper bound on the hitting time of 
the algorithm, and demonstrated that the hitting time scales 
linearly with the number of sensor/relay nodes. This linear 
scaling is desirable, especially when the network is densely 
populated. We have also provided simulations that validate 
our analysis. 

It is important to note that the effectiveness of the one-bit 
adaptive distributed beamforming scheme depends critically 
on the properties of the function Mag(-). Maximizing Mag(-) 
is equivalent to maximizing the received SNR if there is no 
error in obtaining the common message, which is true in 
the training stage since the common message is simply fixed 
and known to the receiver. On the other hand if adaptation 
is being performed blindly (without training) it would be 
necessary to consider the possibility of errors in common 
message. The corresponding objective function may then not 
possess the same desirable properties as Mag(-), e.g., the 
objective function may possess local maxima that are not 
global maxima. Much work needs to be done to understand 
how our results can be applied in this more complicated 
scenario. One thing that is clear, however, is that we will need 
to develop new algorithms that exploit the global structure 
of the new objective function since local algorithms can be 
trapped in local maxima. Our general framework for studying 
adaptive beamforming algorithms is even more useful in this 
context since it connects the problem to a well-studied field 
of global optimization algorithms. 
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